Recognition of read and spontaneous children's speech using two new corpora

نویسندگان

  • Martin J. Russell
  • Shona D'Arcy
  • Lit Ping Wong
چکیده

This paper describes some of the results of research into automatic recognition of children’s speech which has been conducted as part of the European Framework 5 ‘PF STAR’ project. Two new corpora of British English children’s speech are described. The first comprises over 14 hours of read data from 159 children, while the second includes 1 hour and 23 minutes of spontaneous and emotional speech from 30 children. A partition of the data into training, evaluation and test sets is proposed, and the results of ‘baseline’ speech recognition experiments are presented. The results fail to demonstrate a significant improvement from the use of age dependent acoustic models, or that the emotional speech is more difficult to recognise than ‘ordinary’ spontaneous speech.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Syllable detection in read and spontaneous speech

Automatic syllable detection is an important task when analysing very large speech corpora in order to answer questions concerning prosody, rhythm, speech rate, speech recognition and synthesis. In this paper a new method for automatic detection of syllable nuclei is presented. Two large spoken language corpora (PhonDatII, Verbmobil) were labelled by three phoneticians and then used to adjust t...

متن کامل

The DIRHA Portuguese Corpus: A Comparison of Home Automation Command Detection and Recognition in Simulated and Real Data

In this paper, we describe a new corpus –named DIRHA-L2F RealCorpus– composed of typical home automation speech interactions in European Portuguese that has been recorded by the INESC-ID’s Spoken Language Systems Laboratory (LF) to support the activities of the Distant-speech Interaction for Robust Home Applications (DIRHA) EU-funded project. The corpus is a multi-microphone and multi-room data...

متن کامل

On the development of matched and mismatched Italian children's speech recognition systems

While at least read speech corpora are available for Italian children’s speech research, there exist many languages which completely lack children’s speech corpora. We propose that learning statistical mappings between the adult and child acoustic space using existing adult/children corpora may provide a future direction for generating children’s models for such data deficient languages. In thi...

متن کامل

Automatic generation of phonetic transcriptions for large speech corpora

We describe a method for the automatic production of phonetic transcriptions in large speech corpora. First, we focus on the application of different techniques for the generation of pronunciation variants. Then, we explain the application of a speech recognition system for selecting the acoustically best matching phonetic transcription. The system is evaluated on different test sets selected f...

متن کامل

The Phonetic Labeling on Read and Spontaneous Discourse Corpora

Read and spontaneous discourses are two different but very significant speech styles to be investigated. So phonetic labeling on read and spontaneous discourse corpora are made one is ASCCD, a 10 hours read discourse corpus and the other is CASS, a 4 hours spontaneous discourse corpus. First the principles and conventions of transcription are presented. Then, these two speech styles are compare...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004